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I, JOSEPH R. BYRUM HEREBY DECLARE AS FOLLOWS: 

1 . I am a U.S. citizen and currently reside at 91 3 3 1 st Street, West Des Moines, Iowa. 

2. I have been employed by Asgrow Seed Company, Inc., and Monsanto Company, the parent 
company of Asgrow Seed Company, Inc., since 1995; currently with the title of Director, Soybean 
Trait Integration. 

3. I hold a Ph.D. in Plant Breeding and Genetics from Iowa State University and a B.S. in 

Crop and Soil Sciences from Michigan State University. I have been conducting research in the 

area of plant biochemistry, molecular biology and genetics since 1 989. My duties have included the 
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creation of transgenic crop varieties since 1993. 

4. I am an inventor of the above-captioned patent application and am familiar with the 
contents of the patent application. 

5. I understand that the Patent and Trademark Office Examiner in charge of assessing the 
patentability of the referenced patent application has rejected the claims as being obvious. In 
particular, it is my understanding that it has been asserted that it would have been obvious to 
engineer resistance to the herbicides glyphosate and glufosinate in a single soybean plant. 

6. I am therefore providing the present Declaration to submit information demonstrating that 
it would not have been obvious to those of ordinary skill in the field of agriculture and 
biotechnology at the time the application was filed to produce a single soybean plant exhibiting 
resistance to glyphosate and glufosinate. 

7. As indicated in the application, a soybean variety had not been developed having more 
than one herbicide resistance trait prior to the current invention. Any prior assertions that 
glyphosate and glufosinate herbicide resistance transgenes could be successfully expressed in a 
single variety would therefore be speculation. 

The expression of a herbicide resistance transgene requires manipulation of complex 
metabolic pathways of plant cells. Herbicides such as glyphosate and glufosinate normally 
interfere with these pathways. For example, glyphosate inhibits the enzyme 5- 
enolpyruvylshikimate-3 -phosphate synthase (EPSPS) and glufosinate the phosphinothricin acetyl 
transferase (PAT) enzyme (see Scheme 1 on page 13 of the patent application). These enzymes 
are both involved in the synthesis of amino acids in plant cells. The lack of function of the 
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enzymes kills the plant. Engineering resistance to the herbicides requires modifying and/or 
overexpression of altered forms of the enzymes with decreased herbicide susceptibility. 

Because soybean plants do not naturally exhibit herbicide tolerance, the results of a given 
modification are unpredictable. Transgene expression causes complex pleiotropic effects that 
may or may not be detectable at the phenotypic level. The effects can vary depending upon 
factors such as the location of insertion of the transgene in a genome, the transgene being 
expressed, the genotype of the host soybean plant, and the regulatory elements and any enhancers 
used to express the transgene. The expression of enzymes not normally present in the plants also 
creates a "metabolic drag" reducing energy from the diversion of resources to the expression of 
the transgene. Herbicide resistance traits add the uncertainty of potential interactions among 
interrelated metabolic pathways, including negative or positive feedback regulation of different 
pathways from altered substrate or precursor production. 

The above difficulties are not merely additive when combining more than one herbicide 
resistance transgene. Exponential increases in difficulty arise from the interaction of different 
traits. First, particular combinations of herbicides have a synergistic effect. Tolerance to 
glyphosate and glufosinate individually would therefore not necessarily be expected to be 
indicative of tolerance to the combination of these herbicides. Second, metabolic drag and/or 
pleiotropic effects could have limited the availability of substrates in the metabolic pathways 
necessary for the co-expression of glyphosate and glufosinate tolerance transgenes. Third, 
certain traits are known to be negatively correlated. For example, with a few exceptions, 
increases in protein in soybeans have resulted in decreased oil based on negative genetic 
correlations (see, e.g., p. 1038, Wilcox and Cavins. 1995. Crop Sci. 35:1036-1041; see also p.3, 
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Openshaw and Hadley. 1984. Crop Sci, 24:1-4). Negative correlations have also been observed 
between oil content and yield and protein (see, e.g., p. 126 of Hanson et aL 9 Crop Sci. 1: 121- 
126). Absent evidence to the contrary, a negative correlation could also have been observed for 
the combination of glyphosate and glufosinate tolerance traits. 

As a soybean plant had never previously been engineered expressing any two herbicide 
resistance traits, success with combining glyphosate and glufosinate tolerance simply could not 
have been reasonably expected based on the prior knowledge in the art. Any one or more of the 
difficulties mentioned above could have prevented the co-expression of transgenes conferring 
glyphosate and glufosinate tolerance. Given the multitude of variables, the absence of 
pleiotropic or other effects preventing the combination of these traits would have been 
speculation prior to the studies presented in the application. 

8. Based on the foregoing, one of skill in the art would have been without any reasonable 
expectation at the time the application was filed that introducing glyphosate and glufosinate 
resistance transgenes into a single soybean plant would yield a plant that is resistant to both 
herbicides. 



9. I hereby declare that all statements made herein of my knowledge are true and that all 
statements made on information and belief are believed to be true; and further that these statements 
were made with the knowledge that willful false statements and the like so made are punishable by 
fine or imprisonment, or both, under Section 1001 of Title 18 of the United States Code and that 
such willful false statements may jeopardize the validity of the application or any patent issued 
thereon. 
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Backcrossing High Seed Protein to a Soybean Cultivar 

James R. Wilcox* and James F. Cavins 



ABSTRACT 

An inverse relationship between seed yield and seed protein concen- 
tration has limited success in developing soybean [Glycine max (L.) 
Merr.) cultivars with high seed protein. High protein from the donor 
parent YanbV (498 g kg' 1 protein) was backcrossed to *Cutler 71* 
(408 g kg -1 protein) to determine if the yield of Cutler 71 could 
be recovered in addition to the high protein from Pando. Random 
F 4 -derived lines, plus three lines with highest seed protein concentra- 
tion, from the initial cross, the Bd, and the BC 2 populations, were 
evaluated for agronomic traits in separate, two-replicate tests for 1 yr 
at West Lafayette, IN. Seed from replication composites were evaluated 
for protein and oil concentration using near infra-red reflectance or 
near infra-red transmission. The parent line for each backcross was 
selected first for high seed protein, then for yield and agronomic 
similarity to Cutler 71. Random F<-derived progenies of the BC 3 
population, the parent line for each backcross, and the cultivars Pando, 
Cutler 71, and Hamilton were evaluated in three-replicate tests for 
2 yr at West Lafayette, IN. In each backcross generation, lines were 
identified with seed protein in excess of 470 g kg" 1 and that progres- 
sively approached the yield of Cutler 71. In the BC 3 population, one 
line averaged 472 g kg ~ 1 seed protein and was significantly (P = 0.05) 
higher in seed yield than Cutler 71, similar in yield to the cultivar 
Hamilton. In each backcross population, there were inverse relation- 
ships between seed yield and seed protein (R 2 values ranging from 
0.33 to 0.06) and between seed protein and seed oil (R 2 values ranging 
from 0.55 in BCi to 0.94 in BCj). In successive backcross populations, 
minimum oil values increased from 148 in BCi to 174 g kg" 1 in BC 3 , 
indicating a trend toward recovering oil concentration (204 g kg' 1 ) 
of Cutler 71. The data demonstrate that high seed protein can be 
backcrossed to a soybean cultivar, fully recovering the seed yield 
of the cultivar, suggesting the absence of physiological barriers to 
combining high seed protein with high seed yield in these populations. 



Success in breeding soybean cultivars with higher- 
than-normal seed protein has been hampered by an 
inverse relationship between seed yield and seed protein 
concentration. Burton (1984), summarizing results of 
several breeding studies, reported genotypic correlations 
between seed yield and seed protein percentage varied 
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from —0. 12 to —0.74. In only one population was there 
a positive genotypic correlation between these two traits. 
Additional studies by Sebern and Lambert (1984), Simp- 
son and Wilcox (1983), and Wehrmann et al. (1987) 
reported moderate to strong inverse relationships be- 
tween seed yield and seed protein with correlation co- 
efficients ranging from -0.23 to -0.86. 

Previous attempts to backcross high seed protein to 
high yielding soybean cultivars and breeding lines have 
not been successful. Hartwig and Hinson (1972) evalu- 
ated BCi and BC2 progenies from crosses between the 
high yielding recurrent parent, 'Bragg', and a high protein 
parent averaging 112% the protein content of Bragg. 
Selection for seed oil, based on nuclear magnetic reso- 
nance, was effective in obtaining lines that differed in 
seed protein because of the negative relationship between 
seed oil and protein. The highest yielding BC2 lines that 
were similar to Bragg in seed yield and were high, 
intermediate, and low in oil averaged only 397, 418, 
and 435 g kg" 1 protein, respectively. 

Cianzio and Fehr (1982) evaluated seed protein and oil 
of F2-derived lines in the F2 generation and BCiFi-derived 
and BC2Fi-derived lines in the F3 generation of crosses 
between the high protein lines Pando and PI 153.269 
and the high yielding cultivars Wells and Wood worth. 
No line from either set of crosses had protein concentra- 
tions as high as those of the high protein donor parent. 
Mean protein percentages and genetic variances of the 
populations decreased with each backcross to the high 
yielding parent. Their results indicated to them that it 
will be difficult to transfer genes for extremely high 
protein levels to cultivars with lower protein, by back- 
crossing to the low protein parent. No yield data were 
recorded on the breeding lines evaluated in this study. 

Wehrmann et al. (1987) evaluated 95 BC2 progenies 
in each of three populations, where the recurrent parents 
were high yielding lines and the donor parent was Pando, 
that averaged 480 g kg" 1 seed protein. In these popula- 
tions, no backcross-derived lines were recovered that 
combined exceptionally high seed protein with the yield 
of the recurrent parent. In each of two populations, the 
highest protein line averaged only 422 and 433 g kg" 1 
protein and did not differ significantly in yield or seed 
oil from the recurrent parent. In the third population, 
the highest protein line averaged 462 g kg" 1 protein 
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but was significantly lower in both yield and seed oil 
concentration than the recurrent parent. 

None of the backcross studies evaluated progenies 
beyond the BC 2 generation. The lack of success in trans- 
ferring exceptionally high seed protein to high yielding 
cultivars by backcrossing casts doubt on the possibility 
of combining these two traits in adapted germplasm or 
cultivars. 

Our study was initiated to determine if, using succes- 
sive backcrosses, the high seed protein concentration of 
Pando could be successfully transferred into a commer- 
cial soybean cultivar, fully recovering the yield of the 
cultivar. This would demonstrate the absence of genetic 
or physiological barriers to breeding high yielding, high 
protein cultivars. The germplasm developed would also 
be useful in breeding high protein cultivars competitive 
in yield with existing cultivars. 

MATERIALS AND METHODS 

The Maturity Group IV cultivar Cutler 71 (Probst et al., 
1971), averaging 410 g kg- 1 protein and 210 g kg" 1 oil on a 
moisture-free basis, was selected as the recurrent parent. When 
the initial cross was made, Cutler 71 was a high yielding 
Maturity Group IV cultivar adapted to production in central 
and southern Indiana (Wilcox et al. , 1976). The Maturity Group 
00 accession Pando with 522 g kg' 1 protein and 155 g kg" 1 oil 
was used as the high protein donor parent (Bernard et al., 
1995). Cuder 71 was crossed with Pando in 1971 and the 
F 2 through F 4 generations advanced by single seed descent. 
Two-hundred-and-three F 4 plants were selected and grown in 
F 5 progeny rows 2 m long and spaced 0.75 m apart. Data 
were recorded on plant maturity and height, individual rows 
were harvested, and protein and oil were determined on a 25-g 
sample of harvested seed. A random sample of 45 F 4 -derived 
lines and three lines with the highest seed protein were evaluated 
in a two-replicate yield test, planted 12 May 1976. Four-row 
plots 5 m long with 0.75 m between rows were planted with 
450 seeds. Plots were end trimmed to 4 m prior to harvest. 
Data were recorded on the center two rows of each plot for 
the characteristics maturity (95% pods brown), plant height 
(centimeters from soil surface to tip of plants), and lodging 
(scored from 1, all plants erect, to 5, all plants prostrate). 
The center two rows of each plot were harvested and seed 
yield recorded in grams per plot. A 25-g sample of seed, 
composited across replications, was analyzed for protein and 
oil concentration on a moisture-free basis. A line was selected 
first, based on high seed protein, then on similarity to the 



recurrent parent for agronomic traits measured. This line was 
used as the parent for the backcross to Cutler 71. 

The first backcross was made in 1977 and the F 2 through 
F 4 generations advanced by single seed descent. One-hundred- 
sixty-six single F 4 plants were selected and grown in 2-m F 5 
progeny rows, spaced 0.75 m apart. Data were recorded as 
on the progeny rows from the initial cross. The F 4 -derived 
lines from the BC, population were evaluated in three tests, 
one consisting of early maturing lines, the second of midseason 
maturing lines, and the third of late maturing lines. Data on 
38 of the F 4 -derived lines of midseason maturity, 35 of which 
represented random selections and three of which were high 
in seed protein concentration, are reported here. The 38 lines 
were evaluated in a two-replicate yield test planted 23 May 
1982. Plot size and shape were identical to that used for 
populations from the initial cross and the same data were 
recorded. As in the initial cross, a line was selected first on 
the basis of high seed protein, then on similarity in seed yield 
and agronomic traits to the recurrent parent. This line was 
used as the parent for the BC 2 cross. 

The BC 2 cross was made in 1983 and the F 2 through F 4 
generations advanced by single seed descent. Two-hundred- 
forty-two F 4 plants were selected and grown in F 5 plant rows 
2 m long and spaced 0.75 m apart. Data were recorded as on 
plant rows in the original and the BC, cross. A random sample 
of 73 BC 2 lines and three lines with the highest seed protein 
were evaluated in a two-replicate yield test planted 17 May 
1988. Plot size and shape were similar to those used to test 
progenies from the original and the BC, cross except within-plot 
row spacing was 0.60 m. Data were recorded as on replicated 
test plots of the original and BC, progenies. As in previous 
generations, a line was selected first on the basis of high seed 
protein, then on similarity of yield and agronomic traits to 
Cutler 71. This line was used as the parent for the BC 3 cross. 

The BC 3 cross was made in the greenhouse during the 
winter, 1987 to 1988. The F 2 through F 4 generations were 
advanced by single seed descent. Two-hundred-thirty-seven 
F 4 plants were selected and grown in F 5 plant rows 2 m 
long. Data were recorded on plant rows as in the previous 
populations. A random sample of 71 BC 3 lines and the three 
lines with the highest seed protein were selected for evaluation 
in replicated yield tests. Also included in the yield tests were 
the donor and the recurrent parents, the lines selected as the 
parent for each backcross, and Hamilton (Nickell et al. , 1990), 
a currendy grown cultivar similar in maturity to Cuder 71 
and the majority of the BC 3 lines. Hamilton was included as 
a currendy grown yield comparison for the BC 3 lines and the 
older cultivar, Cutler 71. The BC 3 lines and check entries 
were evaluated in three-replicate yield tests in 1991 and 1992. 
Plot size and shape were identical to that used for the BC 2 
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kg ha- 1 
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2145 
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t Moisture-free basis. 

t Score: 1 (all plants erect) to 5 (all plants prostrate). 



Pando 
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CX797-21 
CX1038-14 
CX1307-205 
Cutler 71 
Hamilton 
LSD 0.05 



Donor 
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BC 3 selected line 
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Control 
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474 
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466 
472 
408 
402 
7 
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171 
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204 
214 
5 
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progenies. Data were recorded on each plot as in the tests of 
progenies from the original cross and the BCi and BC 2 proge- 
nies. Protein and oil concentrations of the seed were determined 
on a 25-g sample from each replication in each year of these 
tests. 

The F4 generation, the F 5 plant rows, and the replicated 
yield tests of each population were grown on a Chalmers soi] 
(fine-silty, mixed, mesic Typic Haplaquoll) at the Purdue Univ. 
Agron. Res. Center near West Lafayette, IN. Protein and oil 
concentrations were determined at the USDA-ARS National 
Center for Agricultural Utilization Research, Peoria, IL, by 
near infra-red reflectance for the initial and BCi populations and 
by near infra-red transmission for the BC 2 and BC 3 populations. 
Analyses of variance for randomized complete-block designs 



were used to evaluate the data from the replicated yield tests 
of progenies from the original, BCi, BC 2 , and BC 3 populations. 
The F 4 -derived lines were considered random effects in each 
of these analyses. Years were considered random effects in the 
2-yr analyses of the F 4 -derived lines from the BC 3 population. 
Regression analyses were based on progeny means for each 
of the above populations. 

Hanson's (Leffel, 1990) approximate processed value was 
calculated for each parental line, the high-protein BC 3 selection, 
and Hamilton based on 2-yr mean performance data of the 
BC 3 population. Oil and protein prices were the average prices 
for the 1 2-mo period January through December, 1993 (USDA, 
1994). 



500 




400 




160 170 180 190 200 210 



150. 160 170 180 190 



500 





210 



170 



180 



190 



200 



210 



Oil g kg 



Oil g kg 



-1 



Fig. 1. Regression of seed protein concentration on seed oil concentration in successive backcross populations of Cutler 71 (recurrent parent) x 
Pando (donor parent). Note changes in scale of abscissa. Regression values in Table 2. 
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RESULTS AND DISCUSSION 

Data obtained in the 2-yr test that included the donor 
parent, Pando, and the recurrent parent, Cutler 71, were 
consistent with previous reports on the performance of 
these cultivars. Pando averaged 800 kg ha" 1 seed yield 
matured 6 Aug., and averaged 498 g kg- 1 protein and 
148 g kg* 1 oil (Table 1). In contrast, Cutler 71 averaged 
2461 kg ha" 1 seed yield, matured 27 Sept., and averaged 
408 g kg~ ! protein and 204 g kg- 1 oil. 

There was a wide range in seed protein and oil concen- 
trations among F 4 -derived lines in the initial two-parent 
cross and in each of the backcross populations as evalu- 



ated in their respective tests (Fig. 1). This variability 
provided opportunities to select a line in each population 
with high sfeed protein and, in successive backcross popu- 
lations, agronomic characteristics progressively more 
similar to those of Cutler 71. 

With successive backcrosses, rapid progress was made 
in recovering the yield and other agronomic traits of Cutler 
71 while maintaining high seed protein from the donor 
parent, Pando (Table 1). The F 4 -derived lines selected as 
parents from the two-parent cross, CX602-107, and from 
the BC, population, CX797-21, were similar in seed 
yield and seed protein concentration. Yields of both lines 
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were well above the yield of Pando and approached 
the yield of Cutler 71. Among progenies of the BC 2 
population, the line CX 1038- 14 was similar in yield but 
matured 5 d later than Cutler 7 1 . The F 4 -derived BC 3 line 
CX1307-205 averaged significantly (P = 0.05) higher in 
seed yield than Cutler 71, similar in yield to the cultivar 
Hamilton, and averaged 472 g kg" 1 seed protein in these 
tests. Backcross-derived soybean lines that exceed the 
yield of the recurrent parent have been reported in BC 3 
and BC 4 populations and even in BC 7 populations (Wilcox 
et al., 1971). The selected BC 3 line CX1307-205 was 
similar in maturity date and lodging score to Cutler 71 
but averaged 8 cm taller in mature plant height than the 
recurrent parent. 

Hanson et al. (1961) determined energy requirements 
per unit area needed to produce protein, oil, and protein 
plus oil. From their analysis, they concluded there should 
be no restrictions in producing a high yielding, low oil 
soybean and, if N is not limiting or other physiological 
restrictions are not present, a high yielding, high protein 
soybean. The performance of CXI 307-205 demonstrates 
that high yielding, high protein soybean cultivars can be 
a reality and there does not appear to be any physiological 
barrier to their development. 

Results suggest that not all of the genes controlling 
seed protein concentration in Pando were transferred to 
the recurrent parent. The line with the highest seed 
protein from the two-parent cross was not equal to the 
498 g kg" 1 protein concentration in the seed of Pando. 
In successive backcross populations, the high-protein 
selection was significantly lower in seed protein than 
Pando. However, backcrossing was successful in trans- 
ferring seed protein concentrations as high as 472 g kg" 1 
from Pando to BC 3 progenies that were equal to or 
greater in seed yield than Cutler 71. 

Wehrmann et al. (1987) also used Pando as a source 
of high seed protein in attempts to transfer this trait 
to three high yielding soybean cultivars. They were 
successful in recovering the high yields of the recurrent 
parents after two backcrosses but only moderately suc- 
cessful in increasing seed protein concentration. The 
highest yielding BC 2 F 2 -derived lines averaged 373, 376, 
and 427 g kg" 1 protein compared with 361, 362, and 
413 g kg" 1 for their respective recurrent parents. The 
highest protein line in two populations averaged 422 and 
433 g kg" 1 seed protein, considerably below the 480 g 
kg" 1 of the donor parent, Pando. In the third population, 
the highest protein line averaged 462 g kg" 1 protein but 
was significantly lower in yield than the recurrent parent. 

Success in combining 472 g kg" 1 protein with the high 
yield of the recurrent parent in these populations may 
be due in part to delaying selection for seed protein 
concentration until the F 4 generation when plants should 
be fairly homozygous for loci controlling this trait. Wehr- 
mann et al. (1987) based their selections for seed protein 
on evaluation of seed produced on F2 and F 3 plants. 
Heterozygosity at loci controlling seed protein concentra- 
tion in these early generations may have limited effec- 
tiveness of identifying parents that would consistently 
transmit loci controlling high protein concentrations to 
progenies of successive backcrosses. 



Table 2. Regression values for protein concentration (dependent 
variable) on yield and oil concentration (independent variables) 
for progenies from the two-parent cross Cutler 71 x Pando 
and successive backcrosses to Cutler 71. 



Mean squares 

, due to 

No. of 

Progeny lines Regression Error Slope (b) SEt (b) R 1 



Protein (g kg" 1 ) vs. yield (kg ha" 1 ) 

Two-parent 4S 6 328 274 - 0.037 0.0077 0.33 

BCi 38 3 315 191 -0.028 0.0068 0.32 

BQ 76 1 308 263 - 0.028 0.0125 0.06 

BC 3 74 2 281 186 - 0.031 0.0088 0.15 

Protein (g kg" 1 ) vs. ofl (g kg" 1 ) 

Two-parent 48 14 372 99 -1.505 0.1250 0.76 

BC, 38 5 645 126 -1.173 0.1755 0.55 

BC 2 76 17 330 47 -1.622 0.0840 0.83 

BCj 74 14709 13 -1.717 0.0517 0.94 



t SE = standard error. 

The relationships between seed yield and seed protein 
concentration in successive backcross populations are 
illustrated in Fig. 2. The slope of the regression line for 
each population was significantly (P = 0.05) different 
from 0, and there were no differences in slope among 
regression lines for the different populations (Table 2). 
This demonstrates a consistent inverse relationship be- 
tween seed yield and seed protein, even among progenies 
from the third backcross to Cutler 71 . The low R 2 values 
for the BC2 and BC3 progenies suggest there were greater 
opportunities to identify high yielding, high protein lines 
among these progenies than among the two-parent and 
BCi progenies. 

There was a strong inverse relationship between seed 
protein and seed oil among progenies of each of the 
populations evaluated (Fig. 1). This is consistent with 
reports of Burton (1984), Hartwig and Hinson (1972), 
Sebern and Lambert (1984), Simpson and Wilcox (1983), 
and Wehrmann et al. (1987). Slopes of the regression 
lines when seed protein was regressed on seed oil were 
all negative, significantly (P = 0.05) different from 0, 
and similar for the two-parent, BC2, and BC3 populations 
(Table 2). Averaged across the four populations, there 
was a 1.5 unit decrease in seed protein per unit increase 
in seed oil. This is similar to the 1.5 to 1.6% gain in 
seed protein for each 1 % decrease in seed oil calculated 
by Hanson et al. (1961) in a genetic analysis of energy 
production in the soybean. 

Deviations from regression of protein concentration 
on oil concentration were progressively smaller and /f 2 
values were progressively larger with successive back- 
crosses to Cutler 71 (Table 2). This indicates a stronger 
inverse relationship between seed protein and seed oil 
among progenies of successive backcrosses to the recur- 
rent parent. However, minimum values for seed oil 
increased among progenies of the BCi, BC2, and BC3 
populations (Fig. 1). This indicates that not only was 
the yield of the recurrent parent recovered but there was 
a tendency toward recovering the oil concentration of 
the recurrent parent as well. Maximum oil values would 
not be expected to increase in successive backcross popu- 
lations abpve =204 g kg" 1 , the average value for Cutler 
71, the recurrent parent. 
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Table 3. Approximate processed value based on chemical composi- 
tion of seed of soybean parental selections and Hamilton. Ap- 
proximate processed value based on data averaged across three 
repUcations in each of 2 yr, an oil price of $0,504 kg 1 , and 



Strain 


oakg- 1 

seed 


Protein kg* 1 
seed 


Total kg" 1 
seed 


Value ha' 1 








$ 




Pando 

CX602-107 

CX797-21 

CX1038-14 

CX1307-205" 

Cutier 71 

Hamilton 


0.06S0 
0.0767 
0.0732 
0.0750 
0.0763 
0.0895 
0.0939 


0.2026 
0.1927 
0.1936 
0.1894 
0.1918 
0.1660 
0.1634 


0.2676 
0.2694 
0.2668 
0.2644 
0.2681 
0.2555 
0.2573 


214.08 
577.83 
581.36 
667.71 
758.99 
628.79 
742.31 



The economic values of high protein strains compared 
with the values of high yielding strains with high seed oil 
concentration were compared. Approximate processed 
values, which are functions of seed protein and oil con- 
centration and the price of each component, were calcu- 
lated for parent strains, the high-protein BC 3 selection 
and Hamilton (Table 3). Cuder 71 and Hamilton, with 
oil concentrations typical of currendy grown cultivars, 
had the highest monetary values for seed oil of the lines 
evaluated. Pando, the high protein donor parent, had 
the highest monetary value for seed protein. Seed protein 
values for each of the backcross parents and CX 1 307-205 
were higher than protein values for Cuder 71 and Hamil- 
ton. This resulted in higher total approximate processed 
value for the high protein parent and selections than for 
these two cultivars. When expressed on a hectare basis, 
CX1307-205 had the highest total approximate processed 
value, higher than that of the recurrent parent or the 
cultivar Hamilton. This demonstrates that soybean strains 
with high seed protein can be developed that are as 
profitable, on a chemical constituent basis, as currendy 
grown cultivars. 

This research demonstrates that seed yield of a recur- 
rent parent can be recovered when backcrossing high 
seed protein to the recurrent parent. Selection for the 
recurrent phenotype as a secondary trait to protein in 
each backcross population was effective in the recovery 
of the Cuder 71 phenotype with only three backcrosses. 
An unexpected benefit was the trend toward recovering 
oil of the recurrent parent while maintaining high seed 
protein from the donor parent. The approximate pro- 



cessed value calculations demonstrated that the highest 
yielding, high protein BC 3 line had a higher chemical 
constituent value than a currendy grown cultivar of simi- 
lar maturity. 
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ing blocks in a 2-year period, 1956-1957, and progenies 
were grown from groups of clones selected for superiority 
of several plant characters in each type. These progenies 
were evaluated for the same characters as their parents in 
a 2-year period, 1957-1958. 

Interaction of genotype with environment was large and 
necessarily was removed from estimates of genetic variance. 
Heritability estimates from variance components and parent- 
progeny regressions indicate that there is ample additive 
genetic variance present among , clones of switchgrass to 
make additional progress by selection of clones within 
superior strains of the grass. This viewpoint is further sup- 
ported by realized heritabilities and the close agreement of 
predicted and observed gains for several characters. 

Genotypic and phenotypic correlations among the ob- 
served characters indicate that effective simultaneous selec- 
tion may be made for important characters in the. develop- 
ment of superior varieties provided that sufficiently large 
Dopulations are screened. 
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TPHE emphasis in soybean breeding has been on the pro- 
duction of high yielding, high oil lines; however, a 
hift in emphasis from oil to protein has been made to 
>btain a balanced total program. In practice, an increase in 
)il percentage has been associated with a decrease in pro- 
em percentage. The change in percent oil to the change in 
>ercent protein is considered to be approximately 1 to 2. 
The basis for this statement or, for that matter, the basic 
neasures for studying the genetic control of the energy are 
lot clearly defined. The objective of this paper is to quan- 
ify the genetics involving the production of energy by the 
oybean plant and to relate this information with the 
>resent breeding program. 

A plant produces energy by first synthesizing sugar. The 
nergy which is not used for plant metabolism and growth 
s stored. The form in which the energy is stored and the 
luality and the quantity of the stored energy is the unique 
xpression of a genotype grown within an environment, 
'ield components such as seed size, seeds per fruiting unit, 
nd units per plant are genetically controlled but must be 
onsidered subservient to the ability of a genotype to pro- 
uce energy unless storage limitations exist. Two genetic 
^sterns should be considered: (i) the genetic potential re- 
uired to produce total seed energy and (ii) the genetic 
ontrol of the distribution of energy between seed fractions. 

MATERIALS AND METHODS 

Experimental material— The data used for the genetic analysis 
ere obta ined from Leffel et al. (10). For one test environment, 

1 Joint contribution from Crops Research Division, Agricultural 
esearch Service, USDA, and the Maryland and Illinois Agricul- 
ral Experiment Stations. Publication No. 347 of the U S Re- 
onal Soybean Laboratory, Urbarta, 111. Received for publication 
me 13, I960. 

2 Geneticist, Crops Research Division, Beltsyille, Md.; Agrono- 
ist, Department of Agronomy, University of Maryland, College 
irk, Md.; and Plant Physiologist, U.S. Regional Soybean Labora- 
ry, Urbana, 111. The authors acknowledge the contributions of 
. A. Tabler, Computing Laboratory, Biometrical Services, USDA, 
?ltsvjlle, Md., to the computational aspects of this paper'. 



20 F« soybean lines were randomly selected from each of 45 F« 
families resulting from a diallel cross system. The procedure was 
repeated so that four environments were sampled. Thus, 80 F, 
Jmes were grown from each cross. The environments will be repre- 
sented as 15 25, 16 and 26 in which the digits 1 and. 2 represent 
the Maryland locations, Beltsville and Cambridge, and 5 and 6 
represent, the years, 1955 . and 1956. The experimental material was 
blocked into 20 groups and replicated twice. Each group consisted 
ot one random F e line from each cross. The.analysis for one envi- 
ronment is given in Table 1. The combined analysis is simply a 
pooled within-environment analysis since different F« lines occurred 
within each environment. Although the genotype by environment 
interactions are not available, some of the concepts to be developed 
involve the description of a relationship within an environment 
and the consistency of the description between the environments, 
would C bTheTpfuT " gCn0type by environ ment interaction 

Energy concepts—The mineral composition of the soybean seed 
was taken as 5% and assumed not to contribute significantly to 
total seed energy. Energy sources are identified as protein (Xl%) 
oi and residual (X.%). "Protein" is calculated from 

total nitrogen and therefore includes all nitrogen-containing sub- 
stances Oil is the petroleum ether-soluble materiaj and includes 
a small amount of non-triglycerides. The residual portion not 
accounted for as protein, oil, or ash (X*% ;=95 — Xa% — X,%) 
mav be considered to be mostly carbohydrate including soluble 
carbohydrates and structural material such as cellulose. This re- 
sidual fraction has been represented as sucrose with a caloric value 
ot 3.95 Kcals./gram. 

The average caloric value for soybean oil was estimated as 9 40 
Kcals./gram. An average fatty acid composition for soybean oil 
was taken as 5% linolenic, 50% linoleic, 25% oleic, and 20% 
saturated 18-carbon acids (stearic) (Howell, 6). The small 
amounts of acids of other than 18-carbon lengths were ignored 

^*J? I V e /' f * t ! 7 acid ( G8H » a °*) *** a heat of combustion of 
2634 Kcals./mole. The heats of combustion for oleic and stearic 

TABLE 1 — Analysis of variance for F a line variability within an 
environment. 



Source of variation 



Heps and Groups 
Crosses 

Crosses x Groups 
Error 



Degrees of freedom" 



39 
44 

836 

880 



Expected mean square 



*\* ""Si 

of 



-5 ■ — . ; "I 

a estimates the F, line variability within crosses and within locations and Is taken 
81 as an estimate of the additive genetic variability. • " 
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TABLE 2— Amino acid composition assumed for soybean protein 
(Block_and_Weiss,l). 



Amino add Grama In 100 g. protein 



Glycine 

Alanine 

Valine 

Leucine 

Isolenclne 

Proline 

Phenylalanine 

Tyrosine 

Aspartlc add 



3.8 
4.S 
5.2 
7.6 
5.8 
4.1 
4.8 
3.2 
8.3 



Amino acid Grams in 100 g. protein 



Glutamic add 

Methionine 

Cystine 

Lysine 

Arglnine 

HlsUdlne 

Tryptophane 

Serine 

Threonine 



18.6 
1.1 
1.2 
6.6 
7.0 
2.5 
1.2 
5.6 
3.9 



acids were taken from Kharasch (9). The values for hnolenic and 
linoleic were calculated by assuming that each additional double 
bond reduced energy by an amount equal to the difference between 
stearic and oleic. Three moles of fatty acids are esterified with one 
mole of glycerol (CaHaO,, 397 Kcals./mole) giving triglycerides 
with: an average formula of GsrHjoi.eOe and an average heat of 
combustion of 8299 Kcals./mole or 9-40 Kcals./g. oil. The small 
eneirgy change due to the formation of the three ester bonds is 
neglected. 

The average caloric value for protein was estimated as 4.57 
Kcal./gram. The amino acid composition considered for soybean 
protein was taken from Block and Weiss ( 1 ) and is summarized 
in Table 2. The heats of. combustion for the amino acids were 
obtained from Kharasch (9). The heats of combustion for amino 
acids not listed by Kharasch were estimated from the heats of 
combustion for similar compounds. The energy in the peptide 
bonds was neglected. This energy is about 3 to 4 KcalVbond, 
which is less than 1% of the energy of the amino acid in most 

Carbon equivalent — The products of the first energy-yielding 
sequence of the plant (glycolysis) are three carbon sugar deriva- 
tives. These substances may be channeled into alternate energy 
forms. An input of energy is required in the synthesis of products, 
such as oil, with energy levels higher than sugar. The input energy 
includes the additional energy now stored in the product plus 
energy expended to make the conversion. This expended energy 
can be expressed as "work" carbons or the initial sugar carbons 
required to reach the new energy level. The "work" carbon 
appears as COi and thus is lost to the plant. Carbon equivalent is 
defined as the gram atoms of sugar carbon required to produce a 
finished product including both gram atoms of "work" carbon lost 
in " the synthesis and the gram atoms of carbon stored in the seed 
fraction. This unit of measure expresses a seed fraction in terms 
of the weight of sugar carbon required to produce the fraction 
rather than weight of the fraction. 

For oil, one work" carbon disappears for each 2 carbons present 
in the fatty acid. Since there are 54 fatty acid carbons and 3 gly- 
cerol carbons in the oil formula, the ratio of "work" carbons to 
oil carbons is 27/57 = .474. The "work" carbons for protein are 
much less certain than those for oil because of the uncertainty, of 
the synthetic mechanisms. However, using the information sum- 
marized by Fruton and Simmonds (4), one can estimate the 
"work" carbon required to produce the soybean protein as a dif- 
ference between the total number of carbons in the primary mate- 
rial involved in the amino acid synthesis and the number in the 
amino acids. The "work" carbon to protein carbon ratio was esti- 
mated as .478. It is assumed that no "work" carbon, or only a 
negligible quantity, is involved in the formation of the residual 
fraction. 

The conversion of oil or protein to carbon equivalent would 
also require the factor, (carbon weight in fraction) /(formula 
weight in fraction). For oil, 57 moles of carbon = 684. Ratio 
carbon weight/formula weight = 684/883 = .775. Total oil car- 
bon per 100 grams of seed is therefore .775X». Similar calculation 
based on the amino, acid distribution from Block and Weiss. (1) 
gives a ratio of .532 for protein. If the residual fraction is repre- 
sented as glucose, a ratio of .400 is obtained for this fraction. The 
information on energy, carbon ratios, and "work" carbons is sum- 
marized in Table 3. The conversions involving energy are included 
in the table. The weight of sugar carbon (carbon equivalent) re- 
quired to produce a gram of soybean seed can be calculated directly 
from the information given. An interesting item is that about 
one-fourth of the initial sugar carbons associated with the seed 
complex is lost as "work" carbon to produce the protein and oil 
in the soybean seed. 

Relative genetic sensitivity — The transformation, energy per acre 
or the proportion of sugar carbons, requires the combination of 
two or more measurement, criteria. One needs to define a statistic 



TABLE 3— rCaloric and gram carbon conversion factors for the 
three soybean seed fractions together with a summary for con- 
versions to an energy basis. 



Seed 
fraction 



Kcals./g. 



Grams carbon 
per 100 g. seed 



Grams "work" carbon 
per lOOg. seed 



Protein 
Oil 



4.57 
9.40 
. 3.95 



.S32X, 
.775X, 
.400X, 



.478<.532Xj> 
.474(.775X,) 
0 



Energy per gram by seed fractions 

Ep/g = 4.57V100 
E^/g - 9.403^/100 
Er/g> 3.95X,/100 



Energy per acre by seed tractions . 

Ep/A = k4.57X,Yt 
Ejj/A = k9.40X.Y 

(E p + E G )/A - k(4.S7X t + 9.40X,)Y. 
E/A " k3. 95X,Y + Ep/A + Ep/A 



t X, - (95-X, -X,) X Y - YIELD/A 

which will permit the comparison of two measurement scales with 
respect to identifying true genetic differences. The problem is 
similar to the comparing of different scales of measurement (2, 3, 
11, 12) but including genetic theory. The statistic to be developed 
will be called relative genetic sensitivity. 

Consider an arbitrary scale (M) reflecting the true genetic worth 
for any character. Then a mapping function, f(gi), can be found 
to map the true genotypic scale, (gi), on M. With the assumption 
of an additive genetic model, the mapping function for any char- 
acter becomes a linear regression. The coefficients b^i for /* on gi 
and b/ii for A* on gi are denned as scale parameters rather than the 
normal regression coefficients. The scale parameter can be defined 
as bfii = W/*g% (13). Consider that one has a phenotypic obser- 
vation and wishes to make a statement concerning the true rela- 
tive genotypic value as measured on the scale M. The genetic sensi- 
tivity (A 3 ) would essentially be inversely proportional to the con- 
fidence increment in fi 9 or *i = l/&ibpi<re Xt o<? x being the error 
variance for gi. Two measures will have equal genetic sensitivity 
only if Atb/Ai<*i = Atb^** for any selected level of probability. 
Relative genetic sensitivity is defined as 

With the assumption of independence between *i and *», test 
procedures have been developed involving statistics proportioned 
to X*i and and the values necessary for tests of significance 
have been" tabulated for cases with limited degrees of freedom (2, 
12). For genetic variability studies where the degrees of freedom 
for progeny and error mean squares are usually large, an approxi- 
mate standard error for 0u can be calculated, using the technique 
given by Kendall (8) and the asymptotic normal property of sta- 
tistics estimated from large samples. An approximate standard 
error for 0ia is 

0* [(1 -P*i)/m + (1 - P*0/n,+ H(2-H)/nJ 1/2 /H, 
where m and n* are the degrees of freedom for progeny and for 
error mean squares, respectively, H is the average heritability for 
the two characters, Hi = r *gV(r *g*i -f ac M, and Pi and p. are 
correlations involving the phenotypes and the error deviates, 
respectively* The correlations result from the condition that the 
two phenotypes were measured on the same individual. Hence, \ 
and a, are not independent. If the data were obtained from two 
different sources, Pi = p% = 0, and m and n* would be the har- 
monic means of the degrees of. freedom for progenies and for 
error mean squares, respectively. 

The definition of relative genetic sensitivity is an attempt to 
combine the concepts of measurement sensitivity and genetic theory. 
To have merit, the statistic must not only make biological sense 
but also be simple to use and easy to visualize. The statistic meets 
these requirements. If two measures are equally sensitive in identi- 
fying genetic differences, <f>vt = 1.0. One is thus interested in test- 
ing the deviation of an estimated <Pvt from 1.0. ere*, is the measure 
of the reproducibility of a genotype within a population. Although 
for this study the population is within an environment, both cr^i 
and H can be defined for environments and involve the genotype 
by environment interactions. The value 0u implies only that two 
measures have been equally sampled. lOO^u would be the number 
of environments (or replications) needed for measure criterion 2, 
expressed as a percent of the number of environments (or replica- 
tions) considered for measure criterion 1, to obtain equal sensi- 
tivity in the two measurements for detecting genetic differences. 

"Two reasonable approximations were made to obtain a usable 
formula: (i) V(ag t ,)/((Tg» l )» ^ 2(l/m + l/n^/H', and (ii) 
Hi and H* were replaced by H, the average of Hi and Hi. 
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RESULTS 

The individual year-location analyses as outlined in Table 
were pooled for the four environments, giving 3,344 
legrees of freedom for progeny variability within crosses 
vithin environments and 3,520 degrees of freedom for a 
>ooled error. There would be no reason to analyze the four 
nvironments separately unless conflicting results were 
►btained. Actually, the results found for the different envi- 
onments were surprisingly similar. Unless otherwise stated, 
he analyses will be based upon the pooled analysis. 4 

The primary measures available are yield, percent pro- 
-in, and percent oil. The protein and oil percentages deter- 
line percent residual. The F ratios and the variance com- 
onents for these four measurement criteria are summarized 
i Table 4. The ten parents used in the diallel cross system 
'ere selected for their oil potentials and for diversity of 
enetic backgrounds. The variability found in this study is 
laractenstic of the F 3 line variability of soybeans. Real 
enetic variability for residual apparently exists. However, 
*netic variability for percent protein and oil implies 
enetic variability for residual due to an artifact created by 
le measurement criteria. 

Statistical estimation of energy per gram— Consider the 
>ngenetic (error) deviates within a location. A scale 
irameter of oil to protein would be ac^^. On some 
lergy measurement scale, the scale parameter between two 
ed fractions would be 1.0. Thus, K 21 ae^e, = 1.0, and 
21 — CTe 1 /<rc 2 . The error variability affords an estimate of 
e relative energy scale for two seed fractions. Of equal 
terest would be the relative energy scale based upon geno- 
pic levels. The results are tabulated in Table 5. The 2:1 
ange for protein to oil as determined from the experi- 
mental errors (<y €l /<rc 2 ) is typical of the ratios calculated 
Dm other soybean data, and the energy value for oil 
ould be about twice that for protein, which agrees with 
e ratio of caloric values for the two fractions. Since the 
"dual is taken as the fraction of seed not accounted for 

the two fractions estimated chemically, it must be con- 
ered as a reflection of the two fractions and, thus, diffi- 
lt to interpret. The energy per gram for residual (3.95 
:aIs /g) could be underestimated. That is, if 4.9 Kcals./g 
considered for residual, the ratios based upon error vari- 
ety and upon caloric values per gram are very similar, 
e 1.3 to 1 estimate for the change of protein to oil as 
rermined from the genetic components (vgjvgz) does 
i fit into the caloric value concept. This deviation from 
1:1 change on the genetic scale is found in other data, 
r example, the pooled estimate for the data presented by 
inson et al. (.7) gave 1.6:1. Certainly, an interpretation 
the energy distribution between the three seed fractions 
1 require more than an energy analysis based on the 
pective caloric values. 

Energy per acre— The transformations required for the 
isurements of energy per acre in protein, oil, protein 
s oil, and total are denned in Table 3. The energy-per- 
? measures involve the products of two measures which 
subject to error or the sum of two or more such prod- 
Yield (bu./A.) is the normal measure taken to repre- 
t an approximate measure of energy production and will 
ased as the reference measure. The relative genetic sensi- 
ties of the energy measures as compared with yield are 

Appendix tables giving detailed analyses for the data presented 
his paper can be obtained upon request from Soybean Investi- 
>ns, Crops Research Division, Beltsville, Md. 



TABLE 4 — Statistical description of the F a soybean line variation 
within crosses within environments for the primary measures 
considered. 



Statistic 



Character 



Protein (%) 



F ratio 
Si 



Oil (%) 



Residual (%) Yield (bu. /X. ) 



2.08 
0.3926 



3. 74 
0. 2488 



1.96 
0,3017 



1.62 
5. 379 



17.243 



TABLE 5 — Statistical estimates of relative scale changes for the 
three- soybean seed fractions. 



Measurement 
scale 



O/P 



R/P 



Error 

GeneUc 

Energy 



2. 00 
1.26 
2. 06 



R/O 



1.08 
1.14 
0.86 



.54 
.91 
.42 



TABLE 6— Relative genetic sensitivities with approximate stand- 
ard errors for the energy measures as compared with yield and 
the variability in energy associated with yield. 



Statistic 



Character 



Ep/A 



Eo/A 



Relative genetic sensitivity 
*ly 1.01±.037 ; 99±. 037 .98*. 035 

Relative proportion of variability In E associated with yield 
Phenotyplc 0.96 . 94 ~99 < -i 

Genotyplc 0.95 .89 98 



E/A 



.99 ±.034 



given in Table 6. As previously stated, <j> ly would be a 
measure of the ability to identify genetic differences on an 
energy scale as compared with the ability to identify genetic 
differences on the yield scale. The energy measures are 
essentially as efficient a selection criterion as yield. The 
sensitivity ratios are based upon wi thin-environment vari- 
ability. Actually, both percent oil and percent protein give 
fairly consistent reactions for environments. If the relative 
sensitivity ratios were based on the sampling of two or 
more environments, one might expect the ratios to be 
greater than 1.0. 

The proportions of the phenotypic and the genotypic 
variability in the energy measures explained by phenotypic 
and genotypic variability in yield are included in Table 6. 
The selection for yield is essentially identical with the 
selection for total energy produced per acre. The results, 
however, would have been anticipated since the residual 
fraction is determined as a difference (5% ash assumed) 
and the seed fractions tend to vary proportionally to energy 
per gram. One would also conclude that the selection for 
genetic differences in yield is for all practical purposes the 
same as selection for the genetic capacity to produce energy 
protein plus energy oil per acre. This analogy does not fol- 
low for the selection for energy protein or energy oil per 

Distribution of energy between seed fractions— The 
genetic correlations between the three seed fractions based 
oh percent by weight and caloric energy per gram must be 
identical since the two sets of measures differ only by con- 
stants: The interpretation of the correlations is difficult 
since the base units of measure are not equivalent with re- 
spect to the energy required to produce the fractions. There- 
fore, the carbon equivalent unit was defined (Table 3) 
which expresses each seed fraction in terms of the grams of 
initial sugar carbon required to produce the respective 
fractions in a gram of soybean seed. Only the sugar carbon 
which becomes associated with the seed and the "work" 
carbons lost in the process of changing the simple sugars 
to an alternate energy form in the seed are considered. The 
carbon utilized for growth and metabolism of the plant 
parts other than seed is not considered. 
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The total weight of sugar carbon associated with. 100 
grams of seed would be 

t| = [.532 + (.478) (.532)]X 1 + [.775 
+ (-474)(.775)]X 3 +.400X 8 
for an observed seed composition (Table 3). Thus, the pro- 
portion of sugar carbon ultimately associated with the pro- 
tein fraction and the energy required to produce the protein 
would be 

Px = [532 + (.478)(.532)]X 1 /T i . 
Similarly, 

p 2 = [.775 + (.474)(.775]X 2 /T i ,and . 
p, = .400X,/T lf 

where p 2 and p 3 are the proportions of the original hexose 
carbons associated with the oil and the residual complexes 
respectively- r 

The proportion of the carbons exclusive of the "work" 
carbons which is associated with protein is p^l.478 and 
with oil is pa/1.474. Thus, correlations involving the pro- 
portion of sugar carbons associated only with the seed frac- 
tions or with the fraction plus "work" carbons are identical. 

The average protein and oil based on percent by weight 
and on the proportionate measures are given in Table 7 for 
each environment sampled. Environment 15 would be con- 
sidered an environment resulting in a moderately high level 
of protein while environment 26 would be considered an 
environment resulting in extremely high oil. The transfor- 
mation, thus, does not remove environmental effects. The 
genetic correlations between protein and oil for the two 
measurement criteria are also included in Table 7. In gen- 
eral, the level of protein and oil may vary considerably 
between environments, but the relative performance be- 
tween genotypes tends to be consistent. The genetic corre- 
lation for protein-oil percent reported by Johnson et al. (7) 
was ; — .63 for two crosses and two sampled environments. 
Although the observed correlation (—.54) is based on 
within-environment variability, it is also based on the vari- 
ability within 45 crosses. 

The phenotypic and genotypic correlations together with 
the correlations based on error deviates are listed in Table 
8 for the percent by weight and the proportionate measures 
An increase in the correlations involving protein and oil 
and a decrease in the correlations involving protein and 
residual would have been expected on the basis of the modi- 
fications in scale introduced by the p 4 transformation. How- 
ever, the magnitude of the change (—.543 to — 852) in 
the genetic correlation of protein and oil was not antici- 
pated. If the variation in Pl , p 2 , and p 3 is proportionate to 
that found in a multinomial, then the correlations resulting 
from the restriction, ( Pl + p. + p 3 ) = Xj would ^ _ y * 
-.39, and -.35, respectively, for P-O, P-R, and O-R. 
When considering the degrees of freedom available for the 
variance estimates or the consistency of the correlations in 
lable 8, one must conclude that the deviation observed for 
the genetic correlation (-.852) for protein and oil from 
—.70 (or —.77 based on error deviates) is real. In any 
event, the genetic controls of the energy distribution into 
the protein and the oil seed fractions are two very closely 
interrelated mechanisms. The observation is reasonable since 
a carbon can be associated with only one of the seed 
fractions. 

The errors for protein and for oil are independent while 
the residual fraction is estimated by difference. The negative 
correction between protein and residual and the positive 
correlation between oil and residual based upon error devi- 



TABLE 7 Average protein and oil levels and genetic con 
tions between protein and oil within environments as measi 
on the percent (X.) and prop ortionate <p,> scales 

Location- year 



Protein 



Genetic correlal 



15 
25 
16 
26 

Pooled 



42.8 
41.9 
41.1 
40.5 
41.6 



X,(%) 



.478 
.469 
.459 
.447 
.463 



21.3 
21.7 
22.3 
23.8 
22.2 



.346 
.352 
.362 
.381 
.360 



PlP 



-.421 
-.525 
-.407 
-.759 
-.543 



-.8* 
-.61 

-.9: 
-.st 



n^r™^ ° ty u pIC ^ e ?° typ i c » error correlations for 
percent by weight (X.) and the proportionate (p.) sa 
based upon a pooled analysis. VP ' 



Variables 




Correlations 




Phenotypic 


Oenoryplc 


En 


Protein-Oil 








Percent (Xj) 
Proportionate (pj) 


-.474 
-.816 


-.543 
-.852 . 




Protein-Residual 








Percent (Xj) 
Proportionate (pj) 


756 
-.465 


-.647 
-J 289 


-.6 


Oil-Residual 








Percent (Xj) 
Proportionate (p ? ) 


218 
-.132 


-.288 
-. 255 


7.1 

.+:o 



T ^S^ Rclati 7 .P netic sensitivi "'« with apprbxim; 
standard error of the proportionate measure (p5 as co 
£red with the percent measure (X.) and the variability 
the p! ass ociated with X t . y 

Statistic " ! 



Character 



Protein 



OH 



RelaUve genetic sensitivity 

0P * X * 1.14*. 034 .97*. 023 

Relative proportion of variability in p, associated with Xi 

Phenotypic 924 » 

Genot rp'° ,914 :SS 



Residual 



1.02*^03 



T ^^J°Z Ph ^° tyP l Ct 8 enot yP ic > and error correlations i 
voiying the ratio pi/p 3 and the proportionate residual (t^) 
estimated fo r each environment. vw 

Environment " " 



15 
25 
16 
26 

Pooled 



Phenotypic 



-.122 
-.132 
-.380 
-.466 
-. 253 



Correlations 
Genotypic 



Em 



+.208 
-.037 
-.011 
-.499 
-.054 



-.51 
-.40 
7.75 
-.46 
-.54 



ates would be expected since the laboratory component fc 
percent protein determination is about twice that for o; 
(unpublished data). Since an independent set of measure 
ments was made for each replication, the genetic correla 
tions involving the residual fraction snould not have th 
restrictions inherent for the correlations based on the erro 
deviates. 

/v C v P iu er Pl measures Native to the original measure 
(X,). The relative genetic sensitivities for each proportion 
ate measure as compared with the corresponding measun 
by percent weight are given in Table 9. Genetic variability 
tor X and X 2 implies genetic variability for X 3 ; however 
Pi and p 2 can be varied without necessarily varying p 
Soybean genotypes differ in abilities to produce high enenn 
products, and these differences can be identified with about 
the same precision as that available for X 3 . The p measure 
is superior to X x as a criterion for selecting high protein 
(relative genetic sensitivity of 1.14). Further the p 
measurement scale is measuring modified genetic effects as 
compared with X, measurement scale as indicated by the 
proportion of the genetic variability in Pl explained by X 
In the redistribution of genetic variability through the L 
transformation the Pl measure apparently contains genetic 
effects formerly associated with the residual (and possibly 
error) as measured on the X t scale. 

Finally, with the p, transformation one can .test indirectly 
lor the association of the genetic mechanisms involved in 
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the synthesis of protein (or oil) and the synthesis of 
residual. The correlations between the ratio p,/p 2 and p g 
as measured at the phenotypic, genotypic, and error levels 
are given in Table 10. The data are summarized by envi- 
ronments. Since the residual is determined as a difference, 
the negative correlations involving error could reflect the 
differential variability of the protein and the oil determi- 
nations. The average genetic correlation of approximately 
zero indicates that on the average the sugar carbons are dis- 
tributed to protein and oil in the same proportions in geno- 
types with a high residual as in genotypes with a low 
residual. However, this genotypic expression apparently is 
not a consistent feature, for the environments sampled. For 
the two environments (25 and 16) which have protein and 
oil levels characteristic for the populations studied, the 
genetic, correlations are essentially zero; however, for the 
environment with the moderately high protein (15) the 
genetic correlation is positive while for the environment 
with the extremely high oil level (26) the genetic correla- 
tion is negative. On the surface these correlations appear 
contradictory. The ramifications for these correlations will 
be considered in the Discussion. 

DISCUSSION 

The ideal measurement scale would be one which 
directly reflects the primary gene effects. Total seed energy 
reflects the ability of a genotype to synthesize energy. A 
soybean genotype apparently has the capacity to balance 
growth progressively with its yield potential by continued 
podding or flower dropping. At maturity the seed energy 
is stored with a seed size and an energy distribution be- 
tween seed fractions characteristic of the genotype. There- 
fore, energy produced by a genotype as measured by some 
function of yield is considered to be a more realistic 
measure <of gene effects than any component approach in- 
volving seed size, seeds per fruiting body, fruiting bodies 
per plant, etc., which involve measures incidental to the 
basic genetic problem. Grafius' statement (5) "Yield is 
thus an artifact" is a meaningless statement referenced to a 
geometric interpretation which is incidental to the basic 
genetic problem. 

The assumption has been made that the composition of 
atty acids in soybean oil and of amino acids in soybean 
protein is constant over genotypes, or that the description 
or the average seed composition approximates specific f rac- 
tons within practical limits. With more complete biochem- 
cal information, a more exact p, transformation could be 
ieflned. A realistic and acceptable transformation must 
jave been developed to obtain the correlations reported in 
his study. r 

Distribution of sugar carbons between seed fractions— 
The genetic capacity of a plant to produce total seed energy 
5 limited by its genetic capacity to produce sugar energy. 
Vith the carbon equivalent concept one expresses each seed 
raction in terms of an equivalent base measure, the grams 
f sugar carbon required to produce a fraction including 
work carbon. Energy associated with plant growth and 
letabohsrn is not considered. Let us start with the premise 
iat the initial sugar carbons are not predestined or are not 
isociated with a particular ultimate energy fraction.. That 
a genetic potential to produce sugar energy can be 
isualized. 

The residual comprises a structural fraction (primarily 
illulose) and a soluble carbohydrate fraction. The struc- 
iral fraction would be considered essentially a constant 



feature, at least for the improved soybean genotypes. The 
soluble carbohydrate fraction would represent a reservoir. 
Relative to the genetic variability found in the protein and 
oil fractions, the genetic variability for the residual is re- 
strictive (Table 8). With complete restriction in genetic 
variability for residual the genetic correlation between the 
proportions of sugar carbon being associated with the pro- 
tein and oil complexes (p 1 and p 2 ) would be —1.0. The 
high genetic correlation between p 2 and p 2 is a direct 
expression of the restricted genetic variability in the residual 
fraction. Similar information is available from an examina- 
tion of the genetic components of variation. The relative 
genetic variances for the p l9 p 2 , and p 3 measurement scales 
are 1.00, .98, and .29, respectively, as compared with 1.00, 
.93, and .58, respectively, for the multinomial. 

Apparently, through selection, genotypes with a mini- 
mum cellulose structure have been evolved. Although selec- 
tion can be made for the efficiency of a genotype to produce 
high energy forms, the progress would be restricted. Thus, 
sugar carbons tend to become associated with one of the 
two pathways of synthesis for protein or oil, and the 
protein-oil synthesis could be considered as a competitive 
balance for the sugar carbons between the genes for protein 
and for oil. Hence, a population may be genetically homo- 
zygous for protein factors but segregating for oil factors, 
yet the relative genetic variability for both protein and oil 
would be similar to a population segregating for both pro- 
tein and oil genes. Since genetic variability exists for oil 
within the high oil lines, one can select for high protein 
from high oil lines. However, as one eliminates the factors 
for high oil, he probably will be . faced with inefficient 
genotypes with respect to conversion of residual to high 
energy forms. 

On the surface the signs of the genetic correlations as 
given in Table 10 appear to be reversed for locations 15 
and 26. The postulation has been made that the protein 
and the oil synthesis can be conceived as an association of 
sugar carbons with alternative energy forms and a com- 
petitive balance for the sugar carbons between the genetic 
structure for protein and for oil. In an environment con- 
ducive to a high protein level, environmental factors could 
favor protein synthesis and tip the competitive balance in 
favor of protein factors. The synthesis for oil in the high 
oil genotypes would draw the energy in part from the 
soluble carbohydrates in the residual giving a positive 
genetic correlation between p x /p 2 and p 3 . In environments 
conducive to a high oil level the converse would follow 
giving a negative genetic correlation between pjp 2 arid p 3 . 

Relative scale change between seed fractions — Attention 
has been given (Table 5) to the relative scale change for 
the seed fractions and the similarity of the estimated scales 
to the caloric values for each seed fraction. However, dis- 
crepancies were noted when the measurements were con- 
sidered at the genotypic level. Let us reexamine the data 
in light of the concept of carbon equivalents. The weight 
of sugar carbon used to produce a gram of soybean seed is 
[(.7863)X 1 + (1.1424)X 2 + (.4) (95~X 1 -X 2 )] /100 
(Table 3). With a constant genotypic capacity to produce 
sugar, a change in oil (AX 2 ) reflects directly the change in 
protein, AX, ■= — .7424 AX^. 3863. Therefore, the scale 
change of protein to oil on the error level would be 
AX 1 /aX 2 '.= —1.92, . approximately. The scale change of 
protein to oil between genotypes would invplve different 
genetic capacities to produce energy and relative genetic 
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variability for the residual fraction. With no genetic vari- 
ability in residual (p 3 )> then Ap 2 ~ — Ap 1? and aXj/AXjj 
= — 1.1424/.7863 '= —1.45 as measured on the X t scale. 
Thus, in the selected high oil lines one would expect the 
scale change of perhaps 1.5-1.6 for X x to X 2 as measured 
on the genotypic scale. The breeder will probably find that 
a 2% gain in protein will not be realized for each 1% loss 
in oil when converting to high protein lines. 

However, with the assumption that the change in protein 
to oil percentages averages 1.92:1, the gram carbon of 
sugars required to produce one gram of soybean seed 
(Table 3) is essentially constant for a normal range of 
protein and oil percentages. For an average protein to oil 
change of less than 1.92:1, the percent protein and grams 
of seed produced for a constant amount of sugar energy are 
positively correlated. Since approximately .40, .79, and 1.14 
grams of sugar carbon are required to produce a gram of 
soybean residual, protein, and oil, respectively, increased 
protein or residual at the expense of oil implies higher 
yield for a constant energy potential. 

The negative correlations reported by Johnson et al. (7) 
between yield and percent protein could simply be a mani- 
festation of these results. Within an environment conducive 
to high oil (say limiting nitrogen supply) the high pro- 
tein genotypes tend to be associated with a low residual. 
A reduced residual implies a reduced yield for a constant 
initial energy. These results were also observed in the 
present study where the genetic correlations between yield 
and percent protein were estimated to be -f- .142, +.002, 
—.060, and —.203 for environments 15, 25, 16, and 26, 
respectively. These correlations are to be compared with the 
correlations in Table 10. 

Ramifications for selection — The transformations consid- 
ered in this paper were made so that the measurement 
scale would represent more directly gene effects and an 
interpretation could be made of the genetic mechanisms 
involving the production of the protein, oil, and residual 
fractions. In a practical breeding program for high protein 
the measure percent protein (X t ) still has merit as a selec- 
tion criterion. Selection pressures on percent protein would 
exert a greater pressure for reducing the residual fraction 
than selection pressures on the proportionate (p x ) scale 
(Table 8), principally due to the scale parameters for the 
two measures. On the other hand, if one were to develop 
a feed bean high in protein under an economic structure 
where the oil had a low economic value, the proportionate 
measurement scale would be recommended since p x more 
directly measures the genetic mechanisms for protein syn- 
thesis, the p x measure has a significantly greater relative 
genetic sensitivity than percent protein, and the residual has 
food value for feed purposes. The combination must be 
associated with high yield unless there are physiological 
restrictions, such as inadequate nitrogen during the critical 
period of conversion of protein precursors to seed protein. 

SUMMARY 

The caloric values and the weight of initial sugar carbons 
required to produce a fraction were developed for each' of 
the soybean seed fractions. The caloric measurement scale 
led to agronomic evaluations but was considered inadequate 
for a complete genetic analysis. The latter measure per- 
mitted a transformation (pi) expressing the proportion of 
the initial sugar carbons associated with each seed fraction. 
With the assumption that a genotype has unique capacity to 



produce initial sugar energy, interpretable concepts of pro- 
tein, oil, and residual synthesis resulted. 

A statistic, relative genetic sensitivity, was developed to 
compare two measurement scales as selection criteria. 

Energy per acre based on the seed fractions or upon total 
energy was essentially as efficient a selection criterion as 
yield. The selection for high yield was essentially identical 
genetically with the selection for total energy per acre. 

Genotypes differ in the ability to convert the residual 
fraction to oil or protein. Within highly selected lines, 
however, the genetic variability in the residual was found 
to be restricted as compared with the genetic variability for 
oil and protein. Thus, the genetic correlation between pro- 
tein and oil was high (—.852) as measured on the pro- 
portionate (p t ) scale. 

The scale change for protein to oil percentages was cal- 
culated as 1.92 for random deviates and between 1.45 and 

I. 92 for changes on the genetic scale, based on the concept 
of carbon equivalents. The gain in percent protein for each 
loss of 1% oil in a breeding program for high protein 
would be expected to be less than 2%, perhaps in the range 
of 1.5-1.6%, but for a constant energy potential yield 
should be increased. Experimental estimates for the scale 
change were comparable to the theoretical values. 

The average genetic correlation between percent protein 
and yield estimated from the data was negative but essen- 
tially zero, although this may vary with environments. 

One would expect the genetic correlations between per- 
cent protein and yield to be small and positive. Since 
approximately .79, 1.14, and .40 grams of sugar carbon 
were required to produce one gram of protein, oil, and 
residual, respectively, there should be no restrictions in pro- 
ducing a high yielding, low oil bean and, if nitrogen is not 
limiting or other physiological restrictions are not present, 
a high yielding, high protein bean. 
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ABSTRACT 

The purpose of this study was to investigate the effectiveness 
of selection based upon selection indexes to modify protein con- 
centration of soybean [Glycine max (L.) Merr.] seeds. Selection 
criteria were compared by evaluating F 4 performance • of ¥ r 
derived lines identified in the F, as being in the upper 10% of 
the population. Smith-Hazel indexes designed to maximize the 
rain in protein by including oil or oil and sugar as secondary 
traits were not more effective than direct selection, which re- 
sulted in changes of 1.42% protein and - 1.14% oil. Smith-Hazel 
indexes which gave protein and oil equal economic weights, either 
including or not including sugar as a secondary trait, and a 
desired gains index with desired gains set at 1.0% protein and 
0.25% oil were all similar in results to selection based on the 
sum of protein + oil. Among these methods, changes ranged 
from 0.15 to 0.31% for protein and from -0.01 to 0.19% for oil. 
An index designed to change protein by 1.25% and oil by -0.75% 
produced observed changes of 1.31 and -0.81%. An index de- 
signed to increase protein as much as possible while holding oil 
constant resulted in changes of 0.45% protein and -0.13% oil. 
This index was more effective than selection for protein follow- 
ing culling oil at the mean, which resulted in little change for 
either trait 

Additional index words: Glycine max (L.) Merr., Oil, Sugar. 

Soybean [Glycine max (L.) Merr.] occupies an im- 
portant place in the world market because of its 
oil and protein content. Typically, soybean cultiyars 
grown in the the USA contain about 41% protein, 
21% oil, and 11% soluble carbohydrates. Both oil and 
protein are highly heritable. Based upon information 
from several soybean researchers, Johnson and Ber- 
nard (1963) presented estimates of expected herita- 
bilities of 0.63 for protein and 0.67 for oil when se- 
lection was based on means of two replications of F 5 
or later generation lines grown at two locations. They 
estimated the expected genetic correlation between 
protein and oil to be -0.60. Thus, selection to in- 
crease either oil or protein would be expected to be 
effective but would be expected to lead to a decrease 
in the other. Although the contribution of protein 
to the value of soybeans has increased, at present it 
would not be practical to increase protein at the ex- 
pense of oil. Thorne and Fehr (1970) proposed that 
selection for the sum of protein + oil might increase 
protein without a marked decrease in oil. This 
amounts to selection based on an index in which each 
trait is given equal weight. 

Smith (1936) proposed an index of the form 1 — 
ZbiXj where b i9 the weight given to trait X i9 is derived 
such that the index should best predict the true worth 



of a line. Hazel (1943) extended the concept by con- 
sidering only the additive effects of genes in regard 
to the genetic variances and covariances used in the 
calculations. Hazel and Lush (1942) and Young (1961) 
showed that selection based on such an index was 
theoretically never less efficient than selection based 
on independent culling levels or than the use of tan- 
dem selection for each trait. 

Selection based on the Smith-Hazel type of index 
leads to changes in the breeding values of individual 
traits. Kempthorrie and Nordskog (1959) developed 
a restricted selection index for situations in which it 
is desired to increase the aggregate genetic value as 
much as possible while holding the values of some 
traits constant. . 

Both of the above indexes require the assigning ot 
relative economic weights to each trait. This enables 
secondary traits to be used to increase the progress 
made with primary traits (by assigning economic val- 
ues of zero to the secondary traits) but limits the 
usefulness of such indexes to those cases where the 
breeder is able to designate appropriate economic 
values. Pesek and Baker (1969) presented a desired 
gains index which avoids the need for economic 
weights provided the breeder specifies the genetic 
gain desired for each trait. 

A major handicap to the effective use of all of the 
above indexes is that the variances and covariances 
needed for their construction are subject to large 
sampling errors. Brim et al. (1959) suggested that an 
index which weighted each character only by its rel- 
ative worth would circumvent the problem, but they 
did not speculate on the effectiveness of such an in- 
dex. Selection based on the sum of oil + protein 
would be such a base index (Williams, 1 962), in which 
each trait is assigned equal economic value. 

The purpose of this study was to investigate the 
effectiveness of selection based on indexes designed 
to increase the protein concentration of soybean. Ge- 
netic correlations between sugar and protein were 
reported as -0.68 and -0.60 in two populations 
studied by Openshaw and Hadley (1981). Thus, we 
used sugar and oil as secondary traits to attempt to 
increase the gain in protein resulting from selection. 

1 Contribution from the Dep. of Agronomy, Univ. of Illinois at 
Urbana-Champaign, Urbana, IL 61801. Supported by funds from 
the Illinois Agric. Exp. Stn. Received 17 May 1982. 

1 Former graduate research assistant (present address: Dfc- 
KALB-PF1ZER GENETICS, 3100 Sycamore Road, DeKalb, IL 
601 15) and professor of plant genetics, respectively. 
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Restricted and desired gains indexes and a base index 
composed of the sum of oil + protein were compared 
for their effectiveness in increasing protein while 
controlling the corresponding change in oil. 

MATERIALS AND METHODS 

Two populations were used in this study. Population I 
was derived from the cross HHP X 'Mandell' and Popu- 
lation II from Trotana' X 'Williams'. HHP is a high-protein 
experimental line developed by H. H. Hadley by intro- 
gressing germplasm from Glycine soja Sieb. and Zucc. into 
that of G. max. AH experiments were grown on the Agron- 
omy South Farm of the Univ. of Illinois, Urbana, III. About 
200 F 2 plants in 1976, 150 F 5 lines in 1977, and 150 F 4 
lines in 1978 were grown from Population I. About 500 
F, plants in 1977 and 300 F 3 lines in 1978 were grown 
from Population II. 

F 2 plants were grown in single-plant hills spaced 30 X 
38 cm apart. One hundred fifty F 2 plants from Population 
I and 300 from Population II were chosen on the basis of 
sufficient seed. F s progenies were planted in single-row plots 
2.5 m long, spaced 0.76 m apart, and were seeded at a rate 
of 40 seeds per row. Populations were divided into blocks 
consisting of 30 F 9 lines each and planted as a blocks-in- 
replications design with two replications. Each block also 
contained both parental lines. For Population I, an F 2 -de- 
rived line in the F 4 was grown from bulked seed from F s 
plants of each F 2 -derived family. These lines were planted 
in the same experimental design as were F 9 *s, with lines 
being rerandomized within blocks. In F 9 and F 4 lines, the 
middle 1 .8 m of row was harvested and seeds were analyzed 
for protein, oil, and sugar percentage. 

Values for protein and oil were obtained using a near- 
infrared reflectance grain analyzer (Hymowitz et al., 1974). 
Values for sugar were obtained using a phenol-sulfuric acid - 
method described by Openshaw and Hadley (1981). 

Estimates of variances and covariances were obtained by 
equating observed mean-squares and cross-products from 
the analysis of variance or covariance with their expecta- 
tions and solving for desired variance (covariance) com- 
ponents. The genetic variance among F 2 -derived lines is 
composed of the additive plus one-fourth of the dominance 
variance in the F 3 and of the additive plus 1/16 of the 
dominance variance in the F 4 . Studies have shown that ad- 
ditive genetic variance is the predominant component of 
genetic variance for characters of economic importance in 
soybeans (Brim and Cockerham, 1 96 1 ; Hanson et al., 1 967; 
Croissant and Torrie, 1971). 

Index weights for the Smith-Hazel indexes designed to 
maximize gain were calculated by solving the equations b 
= P~'Ga where P" 1 is the inverse of the phenotypic var- 
iance-covariance matrix, G is the genotypic variance-co- 
variance matrix, and a is the vector of relative economic 
values. In this study, each primary trait was assigned a value 
of 1 and each secondary trait a value of 0. 

Index weights designed to achieve desired gains were 
calculated by solving the equations b = G~ l h where G" 1 is 
the inverse of the genotypic variance-covariance matrix and 
h is the vector of desired gains (Pesek and Baker, 1969). 
Index weights designed to improve protein as much as pos- 
sible while holding oil constant were calculated from the 
above equation by setting the desired gain of oil at 0. Rel- 
ative index weights so obtained are equivalent to those ob- 
tained with the formula given by Kempthorne and Nord- 
skog(1959). 

Predicted gains for individual traits were calculated for 
the Smith-Hazel index as G, = ZbjGjjkA, where G t is the 



predicted gain for the ith trait, bj is the index weight of the 
jth trait, Gu is the genotypic covariance of the ith and jth 
trait, k is the standardized selection differentia] (1.74 for 
Population I and 1.76 for Population II), and cr 5 , = b'Pb 
where b' is the transpose of the vector of index weights. 

Predicted gains resulting from direct selection based on 
two replications at one location were calculated as G = 
hVpk where h* is the heritability of the trait under selection 
as estimated from variance components derived from the 
analysis of variance, <r is the phenotypic standard deviation, 
and k is the standardized selection differential. Predicted 
correlated responses in trait j resulting from selection for 
trait i were calculated as Gj - (h J *h/V /1 r<-ff Pj k where 
and hj* are heritability estimates for the Un and jth trait, 
r^ is the genotypic correlation between traits i and j, es- 
timated from the analysis of covariance, <r Pj is the pheno- 
typic standard deviation of the jth trait, and k is the stand- 
ardized selection differential. Predicted responses were 
adjusted for differences in phenotypic variance between 
years. 

In all selection experiments, the upper three lines were 
identified in each block for a proportion selected of 10%. 
Response to selection was calculated as the difference of 
the mean values for the progenies of selected lines from 
the population mean. 

RESULTS AND DISCUSSION 

Differences between the parents of Population I 
for percentage protein ana oil were considerably 
greater than those between the parents of Population 
II while differences in percentage sugar seemed sim- 
ilar for the two sets of parents (Table 1). These par- 
ents were chosen with the thought that heritabihties 
for protein and for oil would be high in Population 
I and intermediate or low in Population II. 

Estimates of heritability and phenotypic and gen- 
otypic correlations were calculated from the analysis 
of variance and covariance of F 3 generations (Table 
2). Because the trials were grown in only one envi- 
ronment, these estimates could be inflated by geno- 
type X environment interactions, which are fre- 
quently not important for these traits. As expected, 
genetic variances were greater in Population I than 
in Population II for protein (1.47 vs. 0.33), oil (1.20 
vs. 0.35) and the sum of protein + oil (0.47 vs. 0.22). 
As a result, heritabihties for these traits were greater 
in Population I although they were also high in Pop- 
ulation II. Heritabihties for sugar were similar in the 
two populations. 

Genetic correlations were similar to phenotypic 
correlations in both populations. Protein was nega- 



Table 1. Chemical composition of seeds of parental genotypes, t 



Parental genotype 


Year 
grown 


Protein 


Oil 


Sugar 








% 




HHP 


1977 


49.1 


13.9 


7.7 


Mandell 


1977 


46.4 


18.7 


8.5 


Standard error 




0.34 


0.24 


0.17 


HHP 


1978 


49.6 


13.8 


7.9 


Mandell 


1978 


46.2 


19.1 


8.4 


Standard error 




0.29 


0.29 


0.20 


Pro tana 


1978 


45.9 


19.6 


9.0 


Williams 


1978 


45.2 


20.8 


8.6 


Standard error 




0.27 


0.31 


0.17 



t Mean of three replications of plots. 
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tively correlated with oil and with sugar, being most 
strongly so in Population I. The positive correlation 
between oil and sugar was also strongest in Popula- 
tion I. In populations studied by Thorne and Fehr 
(1970) and Miller and Fehr (1979) correlations of 
protein + oil with protein were positive and higher 
than those found in this study while those with oil 
were very low and tended to be negative. The cor- 
relation of protein + oil with oil was low and positive 
in Population I and was slightly larger than that with 
protein in Population II. 

The effectiveness of selecting for the sum of pro- 
tein + oil + sugar was investigated in order to see 
if the proportion of the seed containing these major 
storage nutrients could be increased. Genetic vari- 
ances for the sum of the three mentioned compo- 
nents were estimated in the F 5 as 0.38 for Population 
1 and 0.19 for Population II with corresponding her- 
itabilities of 0.75 ± 0.04 and 0.59 ± 0.05. Using the 
sum of protein + oil + sugar as a selection criteria 
in the F 3 of Population I resulted in a gain of 0.48 
percentage points in the F 4 (significantly different? 
from zero at the 0.01 level). This response was com- 
posed of a decrease of 0.13% protein (significantly 
less than its predicted value at the 0.01 level) and 
increases of 0.47% oil and 0.14% sugar (both not 
significantly different from predicted values at the 
0.05 level). These results show that it is possible to 
increase the proportion of the seed consisting of the 
three major storage nutrients. 

Three selection criteria designed to increase per- 
centage protein were compared for effectiveness in 
the F 3 : 1) direct selection tor percentage protein, 2) 
Index- 1 which included protein as the primary trait 
and oil as a secondary trait, and 3) Index-2 which 
included protein as the primarv trait and both oil and 
sugar as secondary traits (Taole 3). In formulating 
the indexes, the economic weights of secondary traits 
were given values of zero. These indexes were de- 
signed to take advantage of the genetic correlations 
of secondary traits with protein in order to maximize 
the gain for protein. Predicted responses in both pop- 
ulations indicate no practical advantages of the in- 
dexes over direct selection, although observed re- 
sponses in the F 4 of Population I were slightly greater 
for the index methods. In Population I both indexes 
chose the same 1 5 lines and 12 of the 1 5 lines selected 

Table 2. Fi estimates of phenotypic correlationst (above the 
diagonal), genotypic correlations (below the diagonal with 
standard errors), and heritabilityt (on the diagonal with 
standard errors). 





Protein 


Oil 


Protein 
+ oil 


Sugar 


Population I 








-0.59** 


Protein 


0.90 ±0.02 


-0.80** 


0.48** 


oa 


-0.83 ±0.03 


0.93 ±0.01 


0.14 


0.40** 


Protein + oil 


0.44 ±0.08 


0.13 ±0.09 


0.78 ±0.04 


-0.39** 


Sugar 


-0.68 ±0.07 


0.49 ±0.08 


-0.42±0.10_ 


0.72 ±0.05 


Population II 








-0.50** 


Protein 


0.75 ±0.03 


-0.65** 


0.36** 


oa 


-0.68 ±0.05 


0.71 ±0.03 


0.47** 


0.21** 


Protein + oil 


0.37 ±0.08 


0.43 ±0.07 


0.67 ±0.04 


-0.32** 


Sugar 


-0-60 ±0.06 


0.26 ±0.08 


-0.41 ±0.08 


0.67 ±0.04 



by direct selection. The observed decrease in oil ac- 
companying selection based on the indexes was — 1.30 
as compared to — 1.14 when based on protein alone. 
This trend might be expected since the indexes select 
directly against oil. Relative index weights for oil were 
considerably lower in Population II and direct selec- 
tion there chose the same 30 lines as did Index- 1 and 
29 of those chosen by Index-2. 

Increases in percentage protein resulting from di- 
rect selection or from an index designed to maximize 
gain in protein were accompanied by rather substan- 
tial observed decreases in oil. Several criteria were 
applied to the F 3 of Population I to compare them 
for their effectiveness in increasing protein while re- 
ducing or eliminating the accompanying decrease in 
oil (Table 4): 1) the sum of percentage protein + oil, 
as proposed by Thorne and Fehr (1970), 2) Index-3, 
a Smith-Hazel index which included protein and oil 
given equal economic weights, 3) Inaex-4, a Smith- 
Hazel index which included protein and oil given 
equal economic weights and sugar given an economic 
weight of zero, 4) Index-5, a desired J;ains index in 
which desired gains were set at 1.0% protein and 
0.25% oil, 5) Index-6, a desired gains index in which 
desired gains were set at 1.25% protein and -0.75% 
oil, 6) Inrtex-7, in which desired gains were set to 
increase protein as much as possible while holding 
the change in oil at zero, and 7) percentage protein 
among those lines that were greater than or equal to 
the population mean for percent oil. 

Table 3. Comparison of selection criteria in Fj designed to maxi- 
mize gain in percentage protein. 



Relative index weight 



F, response 



Selection 
criterion 


Protein % 


Oil% 


Sugar % 


Predicted Observed 










% Protein 


Population I 








1.64 


1. 2 


Protein 


1.0 


0 


0 


Index! 


1.0 


-0.154 


0 


1.65 


1.64 


Index-2 


1.0 


-0.175 


-0.184 


1.65 


1.64 


Population II 








0.88 




Protein 


1.0 


0 


0 




Index-1 


1.0 


-0.012 


0 


0.88 




Index-2 


1.0 


-0.031 


-0.172 


0.88 




Standard en 


ror 








0.09 



Table 4. Comparison of selected criteria in F$ of Population I de- 
signed to increase protein while controlling change in oiL 

P 4 response 



Selection 
criterion 



** Phenotypic correlation significant at the 0.01 probability level 
t Based on two replications. 



Protein 
Protein + oil 
Index-3 
Index -4 
Index-7 



Index-5 
Index-6 
Cull oU at 
mean 

Standard error 



Relative index weight 


Predicted 


Observed 


Protein %Oil% 


Sugar % Protein 


oa 


Protein Oil 




















% 


1.0 


0 


0 


1.64 


-1.32 


1.42 -1.14 


1.0 


1.0 


0 


0.68 


0.20 


0.15 0.19 


1.0 


1.002 


0 


0.67 


0.21 


0.23 0.15 


1.0 


1.009 


-0.041 


0.67 


0.21 


0.31 -0.01 


1.0 


0.916 


0. 




0 


0.45 -0.13 








Desired 


Observed 


1.0 


0.994 


0 


1.0 


0.25 


0.28 0.14 


1.0 


0.409 


0 


1.25 


-0.75 


1.31 -0.81 










0 


0.03 0.11 



0.09 



0.09 
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The selection indexes designed to maximize the 
gain in combined percentage protein and oil (Indexes 
-3 and -4) and that in which desired gains were pos- 
itive for the two traits (Index-5) were virtually iden- 
tical to selection for the sum of protein + oil. The- 
oretical expectations were much the same for these 
methods although there was some variation among 
observed results. Gains in percentage oil were usually 
comparable to predicted values but gains in protein 
fell far short of expectation. 

Indexes -3 and -4 were designed to maximize the 
gain in economic worth which, in this case, is the sum 
of gains in percentage oil and percentage protein since 
both traits are assigned equal economic values. Pre- 
dicted gains in economic worth for the two index 
methods and for selection for the sum of protein + 
oil were virtually identical. Observed gains were sig- 
nificantly different from zero (0.05 level) but fell short 
of predicted values due to the lack of response in 
protein mentioned above. Observed gains in eco- 
nomic worth ranged from 0.30 to 0.38, there being 
no significant difference among responses (0.05 level). 

Desired gains for Index-6, set at 1 .25% protein and 
— 0.75% oil, showed good agreement with the cor- 
responding observed changes of 1.31 and —0.81%. 
Index-7, designed to hold oil constant while increas- 
ing protein, seemed reasonably effective with an in- 
crease of 0.45% protein and a decrase of 0.13% oil. 
Although the achieved increase was only 32% of that 
resulting from selection for protein alone, the ac- 
companying decrease in oil was reduced to only 1 1 % 
of that which accompanied the direct selection. In- 
dex-7 seemed superior to selection for protein after 
culling for oil at the population mean, which resulted 
in little change for either trait. 

In summary, indexes using oil and sugar as sec- 
ondary traits designed to maximize gain in percent- 
age protein were not superior to direct selection for 
protein. Methods designed to increase both protein 
and oil showed limited success, although it appeared 
that slow progress in that direction could be made. 
The index methods using desired gains which at- 
tempted to hold oil constant or control its decrease 
while increasing protein seemed quite effective and 
were more effective than selection for protein fol- 
lowing culling oil at the mean. 

Although the desired gains indexes were effective 
in controlling changes in both oil and protein con- 



centration, identification of the best line from among 
a group of homozygous or near-homozygous lines as 
the end-product of selection could be done just as 
effectively by direct observation since heritabilities 
are high for both traits. However, a desired gains 
index could be of value in a recurrent selection pro- 
gram in which it is desired to identify a group of 
genotypes for recombination. 
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